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ABSTRACT 

We extract the shape of the stellar velocity ellipsoid as a function of radius 
in M87 from van der Marel's (1994) velocity dispersion data. We include the 
gravitational force of a central black hole with the mass quoted by [Harms et 



al. (199^ . The kinematical data are corrected for the effects of seeing and 
instrumental blurring using a nonparametric algorithm. We find that the stellar 
motions in M87 are slightly radially anisotropic throughout the main body of 
the galaxy, with cr^ ~ 1.20"^. However at exceeds ar within the inner 1" — 2" 
by a statistically significant amount. A number of models for the formation of 
nuclear black holes predict a tangential anisotropy in the stellar motions, and 
our results provide evidence for such an effect in M87. 



1. Introduction 

Possibly no elliptical galaxy has been as thoroughly studied as M87 (NGC 4486) in 
the Virgo cluster. Early observations of the core of M87 revealed a velocity dispersion 
profile that rose toward the center, a feature that was initially attributed to the presence 
of a massive (~ 5 x lO^M©) black hole (Sargent et al. 1978). The same data were soon 
interpreted in other ways; for instance, [Binney fc IVlamon (1982)] demonstrated that models 



without a central point mass were equally acceptable if one was willing to let the stellar 
velocity ellipsoid become strongly prolate at ~ 100 pc from the center. Such extreme 
models were later shown to be dynamically unstable (Merritt 1987), but the instability 
led only to the formation of a mildly elongated bar, which - if viewed from a line-of-sight 
not far from its major axis - yielded nearly the same dependence of velocity dispersion 
on radius as the anisotropic spherical model. The most recent, stellar kinematical data 
(Dressier & Richstone 1990; Jarvis & Peletier 1991; van der Marel 1994) reveal that the 
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velocity dispersion fails to rise within the inner few arc seconds of M87. These data require 
even less anisotropy than the earlier, black-hole-free models. In fact, as emphasized by 
pressler fc Kichstone (1990)| , a ~ 4 x lO^M© black hole can only be reconciled with the 



velocity dispersion data if one assumes that the stellar motions near the center of M87 are 
nearly circular, a configuration that they found implausible. The stellar dynamical data 
thus seem to provide little if any evidence for a dark central mass. 



The case for a massive black hole in M87 is nevertheless quite strong. [Ford et aT 



(1994)1 and [Harms et al. (1994)| discovered a rapidly-rotating, ionized gas disk within the 



inner arc second of M87, with a major axis roughly perpendicular to the M87 jet. At a 
radius of ±0.22" = 16 pc along the major axis of the disk, the spectra show emission lines 
separated by 2V = 916 km s~^. If this gas is in approximately circular motion, and if the 
disk is inclined at 42° as implied by the apparent axial ratio, then M87 contains a central 
dark mass of Mh ~ 2.4 x IO^Mq (Harms et al. 1994). More recent HST observations at 
higher spatial resolution (Ford et al. 1996) appear to confirm this result. 

A nuclear black hole with this mass would be expected to affect the stellar motions 
within a radius Vh = GMh/cr"^ ^ 80 pc ~ 1" in M87, where a ~ 400 km s~^ is the 
stellar velocity dispersion. (We assume throughout a distance to M87 of 16 Mpc; thus 1" 
corresponds to 77.6 pc.) However no increase in the stellar velocities is seen in the inner 
few arc seconds; in fact the best ground-based data suggest a slight decrease in the velocity 
dispersion inside 1" — 2" (e.g. Fig. 1 of Jarvis & Peletier 1991). 

These apparently contradictory data can be reconciled if the stellar motions are 
appreciably circular within the radius of influence of the black hole. Such a model will 
not keep the observed velocity dispersions low at arbitrarily small radii - orbital velocities 
sufficiently close to the black hole must eventually rise according to Kepler's law - but a 
local bias toward circular motions at r ?a r/i can reduce the line-of-sight dispersions at ~ 1" 
by an amount that is sufficient to reproduce the stellar velocity dispersion measurements 
(Dressier & Richstone 1990). It is interesting that a number of models for the formation 
of massive black holes predict anisotropy in the stellar motions at radii close to GMh/a"^. 
For instance, the adiabatic growth of a black hole in a pre-existing galaxy core always 
produces a slight enhancement of circular motions at the expense of radial ones (Goodman 
& Binney 1984). An even stronger effect may be predicted by models in which black holes 
form by the coalescence of smaller black holes (Ebisuzaki, Makino & Okumura 1991); here 
the anisotropy results from the ejection of stars on elongated orbits due to three-body 
scattering by the black hole binary (Quinlan 1996). 

In the present paper, we re-analyze the [van der Marel (1994)[ velocity dispersion data, 
assuming the presence of a nuclear black hole with the mass quoted by [Harms et al.[ 
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(1994). We make use of the fact that - in a spherical system with a known gravitational 
potential - the shape of the stellar velocity ellipsoid at every radius follows uniquely from 
the observed velocity dispersion profile (Binney & Mamon 1982). In the case of M87, the 
stellar M/L (e. g. Richstone & Tremaine 1985) implies that the black hole dominates 
the gravitational force out to a radius of ~ 250 pc ^ 3". The stars also contribute to the 
gravitational force, of course, but their contribution can be reasonably approximated by 
assuming a constant mass-to-light ratio; M/L can then be derived from the virial theorem 
independent of any assumptions about the velocity anisotropy. Such a model is likely to be 
only approximately correct at large radii, where the dark matter surrounding M87 begins 
to affect the gravitational potential. But it is probably an excellent approximation at small 
radii where the influence of the black hole on the motions of the stars is significant. 

The same conceptual model has been used in many previous studies of the stellar 
dynamics of M87. Our treatment is new, however, in two ways. First, ours is the first study 
to to make use of the black hole mass as determined by Harms et al. (1994) and Ford 
et al. (1996). Thus we are able to specify a definite value for (although we consider 
also the effect of varying Mh by its estimated uncertainty). Second, our treatment of the 
data is more sophisticated than in previous studies. Since the dependence of the velocity 
anisotropy on radius, (3{r) = 1 — cr^(r)/(j^(r), is not known a priori, the goal should be to 
generate an estimate of this function directly from the data. In other words, one should 
approach the estimation of (3{r) as an inverse problem, not as a model-fitting problem. 
pinney fc Mamon (1982)] attempted to infer f3{r) directly from the data that were available 
at the time, but they chose to represent the velocity dispersion profile using an ad-hoc set of 
template curves, thus weakening the model-independence of their results. Binney & Mamon 
also did not take into account the possible effects of seeing and instrumental blurring on the 
velocity dispersion data. Van der Marel (1994) took the opposite, "model-fitting" approach, 
assuming a parametrized functional form for (3{r) and then predicting how a given model 
would appear after projection into observable space. However he did not consider any 
functional forms for (3{r) in which the stellar motions were circularly biased at small radii, 
as predicted by the black-hole formation models discussed above. 

The algorithm presented here combines the best features of both these studies. We 
ask simply: What smooth functions (3{r) are consistent with the measured set of velocity 
dispersions? This is an ill-posed problem, since many functional forms for (3{r) are likely 
to be consistent with the data after projection through the galaxy and after blurring by 
the atmosphere and the detector. We accordingly use a regularized algorithm to find the 
"most likely" expression for (3{r). The uncertainty in our estimate is itself estimated via a 
standard bootstrap algorithm; the ill-conditioning of the inverse problem manisfests itself 
most strongly as a widening of the confidence bands at small and large radii, where the data 
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do not strongly constrain the solutions. At no point do we make any assumptions about 
the functional forms of the observed profiles or of (3{r), except insofar as we require these 
functions to be continuous and smooth. Thus we achieve the model-independence aspired 
to by Binney fc IVlamon (1982) , while fully accounting for observational degradation of the 



data as in van der Marel (19941 



We find that a black hole with the mass determined by [Harms et al. (1994)| and [Ford 



et al. (1996)[ implies a substantial change in the shapes of the stellar orbits near the center 



of M87. In order to reproduce the low, central value of the velocity dispersion, the stellar 
orbits must become appreciably circular within the radius of influence of the black hole. 
The statistical significance of this result depends somewhat on the exact value assigned to 
Mh] the larger M^, the greater the tangential anisotropy implied by the velocity dispersion 
data. Such a result - although superficially implausible for a hot stellar system - deserves 
to be taken seriously in the light of black-hole formation models that predict a circular bias 
in the stellar orbits at r ^ r/j. 

The data, none of which are new, are summarized in §2. The nonparametric algorithm 
used to correct the velocity dispersion data for seeing and for instrumental blurring is 
presented in §3. The derivation of /5(r) is the subject of §4, and in §5 we interpret our 
results in the light of recent models for the formation of nuclear black holes. 



2. Data 

Only two observational quantities are needed for this study: the surface brightness 
profile of the starlight in M87, and the line-of-sight velocity dispersion profile of the 

stars, (jp{R). (We assume throughout that M87 is spherically symmetric.) The published 
sources from which our data were derived are as follows. 



HST I-band photometry of M87 for R < 20" were presented by [Lauer et al. (1992) . 



These data have an estimated spatial resolution of ~ 0.1". Between 20" and 163", 
ground-based, Rc band photometry of M87 was published by [Peletier et al. (1990)[ We 



converted the latter data to I-band via Rc — I = 1.153, the average color at these radii. 
The surface brightnesses were then corrected for galactic extinction using Aj = 0.04 (van 
der Marel 1994). 



Our velocity dispersion data were taken from [van der Marel (1994)[ , who used the 4.2m 



William Herschel Telescope in subarsecond seeing to obtain high S/N spectra of M87 within 
~ 25". We followed van der Marel's practice of using only the G-band data presented in his 
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Table CI. Measurements of dp at positive and negative radii were averaged and their quoted 
errors reduced using standard formulae for error propagation. These data are presented in 



3. Correction of the Data for Seeing and Instrumental Blurring 

The data consist of a set of measurements di at discrete positions Ri and their 
estimated errors e^. Our goal is to extract smooth representations of functions like (3{r) 
from these (noisy and incomplete) data. The relation between the data and the unknown 
function (call it u) has the implicit form 



where is a linear operator relating u to measured quantities at R = Ri, and Cj represents 
scatter of the data around their mean value at Ri, due in part to measurement errors. 

The operator C contains contributions from two sorts of effects: data degradation due 
to instrumental limitations, seeing, binning, etc; and projection of the intrinsic function 
u into observable space - for instance, projection onto the plane of the sky. To the 
extent that both effects are understood, at least in a statistical sense, we can write an 
expression for C and apply regularized inversion algorithms to solve for the most likely u 
(Merritt 1993a). Confidence bands on the function estimates can then be computed via the 
bootstrap (Efron 1982). The regularization is necessary since the inverse operation C^^ 
is typically ill-conditioned (O'SuUivan 1986), so that direct inversion of the data would 
produce unacceptably noisy results. 

Rather than derive a single expression for C that includes all of the relevant effects, 
we chose first to correct the data for seeing and instrumental blurring, and then to operate 
directly on the resulting smooth profiles via the inverse operator that represents spatial 
deprojection alone. This two-step procedure is perfectly justified from a statistical point 
of view (e.g. Wahba 1990, p. 19) and has the advantage that it allows us to separate the 
effects of instrumental blurring from the effects of spatial deprojection. 

The spatial resolution of the |Lauer et al. (199T)| HST surface photometry, ~ 0.1", is 
much greater than that of the velocity dispersion observations in the region R < 20" where 
the two data sets overlap. We can therefore ignore instrumental blurring in our estimate of 
the stellar luminosity density profile i/(r). We derived a nonparametric estimate of S(i?) by 
fitting a smoothing spline (Wahba 1990) to the logarithms of the photometric data, log Sj 



Fig. 1. 
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vs. log-Rj. The smoothing parameter was chosen by generahzed cross- vahdation (Wahba 
1990, p. 52). This estimate was then substituted into Abel's equation, 

1 dJ: dR 

to yield an estimate of i/(r). The result was found to be almost indistinguishable from the 
profile displayed by van der Marel (1994, Fig. 8), who used the same photometric data as 
us. We therefore do not reproduce our estimate of z/(r) here. We note in passing that M87 
- although still often described as having a "core" (e.g. Kormendy & Richstone 1995) - in 
fact has a luminosity density profile that increases as a nearly-perfect power law at small 
radii, u ~ r~^, r < 100 pc. This profile appears superficially core-like since an ~ density 
profile produces only a logarithmic divergence when seen in projection. 

In the case of the velocity dispersion measurements, which were made from the ground, 
the observed values might be strongly affected near the galaxy center by atmospheric seeing 
and by the finite spatial resolution of the detector. Here we generate, from the observed 
velocity dispersions a'p^Ri), an estimate of the true velocity dispersion profile ap{R) that 
would have been observed using a perfect detector in the absence of atmospheric blurring. 
This estimate will be used in the next section to generate estimates of the anisotropy profile. 

The true velocity dispersion profile ap{R) is related to the observed profile cr'p{R) via 

i:'{R)a'\R) = — dx dY s(Vx2 + r2) a/(Vx2 + r2) 

'ilW J —OO J —OO 



/■R+l rw r , 

/ dx dy PSF J{X - xY + {Y - 

J R—l J —w L 



(3) 



(e.g. Qian et al. 1995). Here PSF is the atmospheric point spread function; we adopt van 
der Marel's (1994) expression: 

PSF(i?) = Aie-^'/^'^i + ^26-^'/'"', (4) 

with 0-1 = 0.313", CTs = 0.751", Ai = 0.929, and A2 = 0.121. The inner integration is over 
the rectangle whose x-dimension 21 is the CCD pixel width, and whose y-dimension 2w is 
the slit width. These values were taken from Tables 1 and CI of [van der Marel (1994) ; 



beyond 3" from the center, data from more than one pixel were binned together, making / a 
function of position. The outer integration is over the 2-D image of the galaxy, assumed to 
be spherically symmetric. Finally, S'(-R) is the surface brightness profile as it would appear 
after convolution with the seeing disk and the detector. 

Interchanging the order of integration, we can write 

j:'{R)a'p\R) = dR' T.{R')ap^{R') dO K,{R, R' ,9), (5) 



k=l,2 
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erf 
erf 



'R'cosO - R + V 
R' sin 9 + 



erf 



'R' cos9- R-V 



— erf 



V2ak 
R' sin 9 + 



(6) 



where X = R' cos 9 and Y = R' sin 9. This relation can be discretized by writing 

Qi = Ti'{Ri)ap'{Ri) and hj = Tj{Rj)ap{Rj), where Ri and Rj are radial grid values. We then 

have 



n-l 



9i ~ 5Z ^ij^V 



(7) 



with 



Aij = I dR' d9 K{R„R', 

JR. Jo 



This is a "product integration" scheme, and the desired function h is obtained by inverting 
Eq. d^; hj is identified with the solution value at the midpoint of the interval {Rj,Rj^i). 

One could attempt to invert Eq. directly, but the result would be unacceptably 
noisy due to the ill-conditioning of the matrix A. (We in fact attempted direct inversion, 
with discouraging results.) Instead, we looked for the roughness-penalized function hj that 
minimizes 

' d'^ log CTp^ ^ 



J2 U-T.^iA] +^/ 



dR^ 



dR 



(9) 



for some appropriately-chosen A, the smoothing parameter. We represented the penalty 
function discretely as well, via the approximation 



cr. 



2a n • 



R,Y 



(10) 



The selection of the optimal smoothing parameter in ill-conditioned problems is, itself, an 
ill-conditioned problem (Wahba 1990, p. 105). An "infinitely smooth" estimate, i.e. the 
limiting estimate for large A, would be exponential in our case, logUp oc R. We estimated 
the optimal A by eye. Appreciably smaller values led to estimates that seemed unphysically 
noisy, while much larger values biased the solution toward the infinitely smooth, exponential 
dependence. For a fairly wide range of A's around our adopted value, however, there was 
no significant variation in the form of the estimated function. 

The results are shown in Fig. 1. The solid curve is our estimate of the corrected, 
line-of-sight velocity dispersion profile, ap{R) = \jh{R) /Tj{R). The dashed lines in Fig. 
1 are confidence bands on the estimate. These were constructed by first generating 300 
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bootstrap data sets (Tp{Ri) from the original data, according to the scheme: 

apiR,) = fiR,)+6i, (11) 

where / is a smoothing sphne fit to the uncorrected data and Si ~ A/'(0, e^) is a random 
number generated from the normal distribution with dispersion equal to the measurement 
error at Ri (Wahba 1990, p. 71). For each of these bootstrap samples, the deblurring 
algorithm just defined was used to estimate ap{R). The distribution of bootstrap estimates 
at every R was then used to construct the confidence bands in the figure. 

The corrected velocity dispersion profile in Fig. 1 follows the raw data outside of ~ 1". 
At smaller radii, the corrected profile rises slightly above the mean relation defined by the 
measured values. We would not expect the corrected profile to deviate strongly from the 
raw data even at these small radii, since the observed velocity dispersions are approximately 
constant within the inner two arc seconds, and a flat profile remains fiat after deblurring. 
However, the bootstrap confidence bands suggest that quite a wide range of profile shapes 
are consistent with the data inside of ~ 1", from a fiat or inwardly decreasing profile, to 
one that increases as far as ~ 450 km s~^ at 0.1" (95%). We note that the widening of 
the confidence bands at small radii is due primarily to the amplification of errors that 
accompanies the deconvolution; the measurement errors quoted by van der Marel (1994) 



are almost independent of radius, but their consequences become greater near the center 
due to the increased importance of the seeing and instrumental corrections there. 

The flatness of the velocity dispersion profile very near the center of M87 was noted by 
Dressier (1980)| , [Dressier fc Richstone (1990)| , [Jarvis fc Peletier (1991)| and [van der Marel) 



1994)1 in their respective data sets. This result appears therefore to be robust, although it 
could of course reflect some systematic error in the interpretation of the data. pressler"S| 
Richstone (1990)] emphasized the difficulty of reconciling a low central velocity dispersion 



with a massive {M^ > IO^Mq) nuclear black hole, unless the stellar motions become 
strongly anisotropic near the center. We will confirm their result below. 



4. Estimation of the Velocity Anisotropy 

In a nonrotating, spherical stellar system, the dependence of the velocity anisotropy 
f3 = 1 — o'^/a'^ on radius can be inferred uniquely from the observed velocity dispersion 
profile ap{R), if one is willing to assume that the functional form of the gravitational 
potential is known (Binney & Mamon 1982). We assume that the gravitational potential of 
M87 can be expressed as 

*M = _^ + f41U,M, ,12) 
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where is the mass of the central black hole, M/ L is the mass-to-light ratio of the stars, 
and $L is the "potential" corresponding to the stellar luminosity distribution: 



-AttG f - 



z/r dr + 



urdrj 



(13) 



with i/(r) the luminosity density. The expression ([T^) is not completely general, since it 
assumes that the stellar mass-to-light ratio is independent of radius. However we expect 



Eq. (12) to be sufficiently general out to a radius of several arc seconds where the black 
hole dominates the gravitational force. Relaxing the assumption of a constant M/L would 
make the derivation of (3{r) impossible without additional kinematical information (Merritt 
1993b). 



The stellar velocity dispersion in the radial direction is given by 



+ cos ^ 



j:{R)(rl{R) RdR 



2 

3r^ Jr 



,o 1 o\ , d$ , , 
2 J ^ ' dr' 



(14) 
(15) 



(Dejonghe & Merritt 1992, Eq. 57a), and the tangential component follows from the 
spherical Jeans equation: 



'd^ Idual^ 



dr 



(16) 



We can obtain a nonparametric estimate of crr(r) by substituting our smooth estimates of 
z/(r), S(i?) and <7p{R) into Eq. (p!5| ) (Wahba 1990, p. 19). However we ffist need to specify 
the parameters Mh and M/L that appear in our expression ([12|) for the potential. 



We ffist consider Mh- [Ford et al. (1994]| used HST WFPC-2 narrow-band images 
of M87 to find a 100 pc (~ 1") scale disk of ionized gas with a major axis oriented 
approximately perpendicular to the M87 jet. Parms et al. (199T)| used COSTAR and the 
FOS to measure the velocity at five positions in this disk and concluded that there was 
Keplerian rotation around a mass of (2.4 ± 0.7) X IO^Mq. More recent HST observations 
with a smaller, 0.086" aperture appear to con&m this interpretation, although the scatter 
in the velocities is still consistent with a fairly wide range of masses, from 1 to 3.5 x IO^Mq 
(Ford et al. 1996). This range is essentially equal to the ±2a interval quoted by [Harms et 



[al. (199?)[ . We will accordingly compute our kinematical solutions using the three values 
Mh = {1.0, 2.4, 3.8} X IO^Mq, corresponding to the [Harms et al. (1994)[ value plus or minus 
twice its estimated uncertainty. 

The second parameter, M/L, can be derived from the observed velocity dispersion 
profile independent of any assumptions about the stellar velocity anisotropy. The virial 
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theorem 



3 / EalR dR = 2 u—r^ dr (17) 
JO ^ JO dr 

(e. g. Dejonghe & Merritt 1992, Eq. 51) becomes, with our assumed form for $(r), 

M ^ 3 /o°° dR mial - 2^71^;, dr rv ^^^^ 
L 8ttG /o°° dr rv dr'r''^v ' 

which allows M/L to be determined from previously-computed quantities. (In order to 
evaluate the integrals in Eqs. ([T5|) and (|18D that extend to infinite radius, the velocity 
dipsersion profile was extrapolated as a power law beyond 25".) Our estimates of the I-band 
M/L in solar units are given in Table 1. Finally, confidence bands on M/L and on the 
function estimates were generated by repeating the set of computations just described for 
each of the 300, bootstrap-generated data sets defined above. 

Estimates of o"r.(r), crt(r) and (3{r) and their 95% confidence bands are shown in Fig. 
2. Fig. 2b gives results for Mh = 2.4 x IO^Mq, the value preferred by [Harms et al. (1994) 



while Figs. 2a and 2c illustrate how these function estimates change when M^ is increased 
or decreased by twice its estimated uncertainty of 0.7 x IO^Mq. The confidence bands 
become very wide at both large and small radii. At small radii, the effects of spatial 
deprojection and instrumental deblurring are greatest. At large radii, the lack of velocity 
data beyond ~ 25" implies a large variance in quantities like that formally depend on 
integrations to infinity. 

For Mh = 1.0 X IO^Mq, /?(r) is found to remain approximately constant with radius, 
(3{r) ^ 0.4 — 0.5, corresponding to crr/<^t ~ 1.3cr(. The stellar velocity ellipsoid is radially 
elongated everywhere in this case, although the 95% confidence bands are consistent with 
motions that are close to isotropic. The need for radially-biased motions when the black 
hole mass is small or zero has been noted by a number of authors including [Binney ^ 
Mamon (19821 , [Dressier fc Richstone (19901 and [van der Marel (1994)[ . 



However when M^ is increased to its preferred value of 2.4 x lO^M©, /3(r) is forced to 
decline at small radii in order to maintain the observed, low value of the central velocity 
dispersion. The central decline in /?(r) becomes even more extreme if Mh is increased to 
3.8 X 1O^M0 - in this case, the inferred value of actually falls below zero inside of ~ 0.4" 
(Fig. 2c). A similar conclusion was reached by pressler fc Richstone (1990)[ , who used an 



orbit-based code to infer the stellar kinematics in M87 assuming M^ = 3.6 x 10^M( 



0- 



Although our best estimates of /5(r) suggest a rapidly increasing anisotropy at small 
radii, the confidence bands on f3 become very wide at small r, and it is not obvious from 
Fig. 2 whether the central decline in (3 that we find using the two larger values of Mh 
is statistically significant. One way to address this question is to ask: What fraction of 
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the bootstrap estimates have a lower value of (3 at 0.5" (say) than at 2"? We find that 
the decrease in (3 between 2" and 0.5" is significant at the 98% level for = 2.4 and 
3.8 X IO^Mq, and at the 43% level for Mh = 1.0 x lO^M©. If |bbrd et al. (1996)| are correct 
in speculating that turbulent motions in the M87 gas disk might imply an even larger mass 
than 3.8 x IO^Mq, the central decline in (3{r) would become even more significant. 



5. Discussion 

Our conclusion that is approximately equal to, or slightly greater than, at throughout 
the main body of M87 is unremarkable. However the apparent predominance of circular 
over radial motions at the very center of M87 is potentially more interesting. A number 
of models for the formation of nuclear black holes predict a bias toward circular motions 
within the radius of infiuence of the black hole, = GM^/a'^, or ~ 1" in M87. This is 
roughly the radius at which we find that a,, must fall below at when > 1.0 x IO^Mq. 
(As often noted, Vh in M87 is also roughly equal to the size of the seeing disk, a coincidence 
that is hopefully fortuitous!) Thus it seems appropriate to ask whether the anisotropy that 
we infer in the stellar motions could be a relic of the black hole formation process. 

In one widely-discussed class of models, black holes grow from the accumulation of gas 
on a timescale long enough that the stellar action variables are adiabatically conserved. The 
gradual deepening of the gravitational potential in these models can induce a tangential 
anisotropy in the stellar motions at radii near Vh- [Young (1980)| first noticed this effect in 



numerical calculations, and [Goodman fc Binney (198T]| showed that the adiabatic growth of 



a black hole at the center of an initially isothermal galaxy always enhances the tangential 
components of the velocity dispersion at the expense of the radial component. However 
the effect is subtle, since the eccentricities of orbits in axisymmetric systems are almost 
unaffected by slow changes in the potential (Lynden-Bell 1963). [Cjuinlan, Hernquist fc 



[Sigurdsson (1995][ followed the adiabatic growth of black holes in a variety of spherical. 



initially isotropic galaxy models, with and without constant-density cores. The resulting 
/3(r) profiles varied from model to model, but the value of (3 ai r = Vh was always close to 
its minimum, with Pmin typically lying between —0.3 and 0. Such mild anisotropics would 
be almost undetectable in M87 given the quality of data currently available. 

Black holes might also grow by the capture of stars on low angular momentum orbits, 
whose pericenters lie within the tidal breakup radius rt of the black hole (Hills 1975). One 
result would be to remove elongated orbits from the central regions of the galaxy and hence 
to increase the ratio of at to cJj. (e.g. Cohn & Kulsrud 1978, Fig. 8). However the tidal 
radius of a ~ IO^Mq black hole is comparable to its Schwarzschild radius, or ~ lO"'^ pc 
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in the case of the M87 black hole - much too small to be observed. Scattering of stars at 
larger radii into the loss cone (Frank & Rees 1976) would be unimportant due to the long 
relaxation time in M87. 

In yet another class of models, nuclear black holes grow through the accretion of other 
black holes acquired from galaxy mergers (e.g. Ebisuzaki, Makino & Okumura 1991). 
Dynamical friction drags the black holes into the central region where they form a bound 
pair. The subsequent evolution was outlined by [Begelman, Blandford fc Rees (1990)| : the 



binary separation shrinks with time — at first through dynamical friction, later because of 
three-body scattering processes in which the hard binary ejects stars from the center — until 
it becomes small enough that gravitational radiation causes the black holes to merge. The 
mass of stars ejected by the binary can be comparable to the sum of the two black hole 
masses (Quinlan 1996). Because the binary interacts most strongly with stars on elongated 
orbits, three-body scattering should introduce a tangential bias in the velocity distribution 
of the remaining stars. Detailed simulations of this complicated process have yet to be 
published, but it seems plausible that the induced anisotropy could be much greater than 
in the adiabatic-growth or tidal-disruption models. 

Although our conclusions about the stellar dynamics of M87 are qualitatively consistent 
with the predictions of all these models, we stress that the case for a massive black hole 
is not strengthened in any way by our work. The low, central value of the stellar velocity 
dispersion in M87 continues to be surprising given the independent evidence for a dark 
mass. If a massive black hole is indeed present in M87, then nature has somehow conspired 
to hide its effects on the stellar kinematics by adjusting the shapes of the stellar orbits. 
As discussed above, such a conspiracy is not impossible to imagine, but one would still 
like to find evidence in the stellar kinematics for the central singularity. The bias toward 
circular orbits that we infer at ~ 1" is not sufficient to keep the observed dispersions low at 
arbitrarily small radii; eventually, even exactly circular motions would be expected to show 
a Keplerian rise. Observations of M87 with the Space Telescope Imaging Spectrograph 
(Baum et al. 1996) should allow accurate measurement of the stellar velocity dispersion at 
a radius of 0.2" ^ 16 pc, where the orbital velocity around a 2.4 X IO^Mq black hole would 
be ~ 800 km s^^. Such high velocities would leave their mark on the line-of-sight velocity 
dispersion profile regardless of the shapes of the stellar orbits. 



We thank R. van der Marel for assisting in the data interpretation, and G. Quinlan 
for helping us to understand the various models for black hole formation. This work was 
supported by NSF grant AST 90-16515 and NASA grant NAG 5-2803 to DM. 
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Table 1: M87 Stellar Mass-To-Light Ratio 



Mu/Mq {M/L)i 68% 95% 

1.0 X 10^ 3.76 (2.94,5.96) (2.33.9.44) 

2.4 xlO^ 3.72 (2.89,5.91) (2.29,9.39) 

3.8 xlO'^ 3.68 (2.86,5.88) (2.26,9.31) 
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Fig. 1.- 

Line-of-sight velocity dispersions in M87. Open circles: G-band measurements from 
van der Marel (1994)| . The innermost data point has been shifted from its true position 
at R = 0". Solid line: velocity dispersion profile corrected for the effects of seeing and 
instrumental blurring. Dashed lines: 90%, 95% and 99% confidence bands on the corrected 
profile. 

Fig. 2.- 

Velocity dispersions and anisotropy as a function of radius in M87, for three different 
values of the black hole mass: Mh = 1.0 x lO^M© (a), 2.4 x IO^Mq (b) and 3.8 x lO^M© (c). 
Left hand column: radial (heavy curves) and transverse (thin curves) velocity dispersions; 
dashed curves are 95% confidence bands. Right hand column: velocity anisotropy, 
P = 1 — a^/a"^, with 95% confidence bands. 
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Fig. 1.- 
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